Graph Transduction Learning of Object Proposals for Video Object Segmentation

نویسندگان

  • Tinghuai Wang
  • Huiling Wang
چکیده

We propose an unsupervised video object segmentation algorithm that detects recurring objects and learns cohort object proposals over space-time. Our core contribution is a graph transduction process that learns object proposals densely over space-time, exploiting both appearance models learned from rudimentary detections of sparse objectlike regions, and their intrinsic structures. Our approach exploits the fact that rudimentary detections of recurring objects in video, despite appearance variation and sporadity of detection, collectively describe the primary object. By learning a holistic model given a small set of objectlike regions, we propagate this prior knowledge of the recurring primary object to the rest of the video to generate a diverse set of object proposals in all frames, incorporating both spatial and temporal cues. This set of rich descriptions underpins a robust object segmentation method against the changes in appearance, shape and occlusion in natural videos.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised Graph Based Video Object Extraction

A method to extract the object containing regions in the ‘object proposal’ set in the video is employed. The segmented object containing areas are then employed to construct segmentation models for optimal video object extraction. First, an unsupervised graph based framework is used for detection and extraction of foreground in the video. We take into account the general properties (spatially c...

متن کامل

Segmentation Assisted Object Distinction for Direct Volume Rendering

Ray Casting is a direct volume rendering technique for visualizing 3D arrays of sampled data. It has vital applications in medical and biological imaging. Nevertheless, it is inherently open to cluttered classification results. It suffers from overlapping transfer function values and lacks a sufficiently powerful voxel parsing mechanism for object distinction. In this work, we are proposing an ...

متن کامل

Object-Oriented Method for Automatic Extraction of Road from High Resolution Satellite Images

As the information carried in a high spatial resolution image is not represented by single pixels but by meaningful image objects, which include the association of multiple pixels and their mutual relations, the object based method has become one of the most commonly used strategies for the processing of high resolution imagery. This processing comprises two fundamental and critical steps towar...

متن کامل

Cross-Granularity Graph Inference for Semantic Video Object Segmentation

We address semantic video object segmentation via a novel cross-granularity hierarchical graphical model to integrate tracklet and object proposal reasoning with superpixel labeling. Tracklet characterizes varying spatial-temporal relations of video object which, however, quite often suffers from sporadic local outliers. In order to acquire highquality tracklets, we propose a transductive infer...

متن کامل

An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos

Deep learning has been demonstrated to achieve excellent results for image classification and object detection. However, the impact of deep learning on video analysis (e.g. action detection and recognition) has not been that significant due to complexity of video data and lack of annotations. In addition, training deep neural networks on large scale video datasets is extremely computationally e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014